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Background of the Invention 

Retroviruses are a group of viruses whose genetic material consists of single-stranded RNA. Following 
adsorption and entry of the retroviral RNA into the host cell, the viral RNA serves as a template for the 
synthesis of a complementary DNA strand. The DNA is then made double-stranded through the action of an 
enzyme having DNA polymerase activity; it is this double-stranded DNA which integrates into the host 
genome. The RNA-directed DNA polymerase activity responsible for the synthesis of complementary DNA 
from the viral RNA template is commonly called reverse transcriptase. 

Retroviruses are of particular interest because a number of retroviruses have been implicated as the 
causative agents of various cancers, and other diseases. A retrovirus, human immunodeficiency virus, is the 
causal agent of acquired immunodeficiency syndrome (AIDS). Additionally, the reverse transcriptase 
enzymes themselves have become important reagents in molecular biology because of their ability to make 
complementary DNA from almost any RNA template. Thus, reverse transcriptase is commonly used to 
make nucleic acids for hybridization probes and to convert single-stranded RNA into a double-stranded 
DNA for subsequent cloning and expression. 

Recently, reverse transcriptases have been used as a component of transcription-based amplification 
systems These systems amplify RNA and DNA target sequences up to 1 trillion fold. See e^g,, Burg eta|., 
PCT Patent Application WO 89/01050 (1988); Gingeras etaL, PCT Patent Application WO 88/10315 (1988); 
Davey and Malek. European Patent Application EPO 0329822 (1988); Gingeras et_al, European Patent 
Application EPO 0373960 (1989); Malek and Davey. PCT Patent Application WO 91/02814 (1989); Kacian 
and Fultz, European Patent Application EPO 0408295 A2 (1990). All of these references are hereby 
incorporated by reference into this disclosure. 

Some of the transcription-based amplification methods are exceptionally convenient since the amplifica- 
tion reaction according to these methods is isothermal. Thus, these systems are particularly suited for 
routine clinical laboratory use in diagnostic tests. Detection of pathogens causing infectious diseases and 
gene sequences associated with cancers or genetic diseases are among the most important uses of such 
tests Reverse transcriptases are also employed as an initial step in some protocols when the polymerase 
chain reaction (PCR) is used to amplify an RNA target. See Malek eta., U.S. Patent No. 5,130,238 (1992); 
and Mocharla et al., Gene 99:271-275 (1990). In such "RT-PCR" procedures, the reverse transcriptase is 
used to make irTinitial complementary DNA (cDNA) copy of the RNA target, which .s then amplified by 
successive rounds of DNA replication. 

The retroviral reverse transcriptases have three enzymatic activities: a RNA-directed DNA polymerase 
activity a DNA-directed DNA polymerase activity, and an RNAse H activity. See Verma, The Reverse 
Transcriptase, Biochim. Biophys. Acta 473: 1-38 (1977). The latter activity specifically degrades RNA 
contained in a n RNA:DNA duplex. Degradation of the RNA strand of RNA:DNA intermediates by RNAse H 
is an important component of some transcription-based amplification systems and is to be distinguished 
from unwanted degradation due to contaminating nucleases, which interferes with amplification. 

A disadvantage of the transcription-based amplification systems is their sensitivity to even trace 
amounts of nucleases. Since a number of important diseases may yield samples containing very few target 
nucleic acid molecules, the detection of small amounts of the target is often crucial for an accurate and 
timely diagnosis. Indeed, the value of target amplification methods is most important when the number of 
target molecules is low. At low input levels of the target nucleic acids, unwanted degradation of RNA targets 
or RNA or DNA reaction intermediates can lead to amplification failures and consequent misdiagnosis. 
Ribonuclease contamination is also a problem in RT-PCR reactions, since loss of the RNA target can lead 
to amplification failure. 

Ribonucleases are relatively ubiquitous, and, in particular, are found in high concentrations in a variety 
of biological materials, including preparations of retroviruses and in cells commonly used to express 
recombinant proteins. Ribonucleases frequently contaminate reverse transcriptase preparations from a 
variety of sources and have been reported to interfere with synthesis of cDNAs, preparation of probes, and 
other uses besides target amplification alone. Often, an RNase inhibitor is included in the reaction to 
minimize the deleterious effects of this contamination. See ag., Maniatis et_al., Molecular Cloning: A 
Laboratory Manual 8.11-8.13 (2d ed. Cold Spring Harbor Laboratory Press 1989), hereby incorporated by 
reference herein. . 

However a number of substances commonly used to inhibit or inactivate RNAses, including detergents, 
chaotropes, organics, metals, proteases and metals are inappropriate for use in target amplification systems 
since they will inhibit the enzymes used for amplification as well. RNAse-inhibiting proteins such as human 
placental RNAse inhibitor, Blackburn et_aL, J. Biol. Chem. 252: 5904 (1977) or rat liver RNAse inhibitor, 
Gribnau et al., Arch. Biochem. Biophys. 130: 48-52 (1969), may be unstable, are expensive, and can 
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contribute additional interfering substances such as nucleic acids and RNAses that are not inhibited by the 
inhibitor. 

In addition to nucleases, traces of other enzymes, nucleic acids, and certain buffer salts may interfere 
with amplification reactions. While these substances are merely undesirable for many uses of reverse 
transcriptase, because of the nature of the amplification reaction it is critical that the enzyme preparation 
contain as low an amount of them as possible. 

Isolation and purification of reverse transcriptase from various sources have been reported. In cases 
where the enzyme is isolated directly from virus particles, cells, or tissues, the cost is too high for 
widespread commercial use in diagnostic tests. See e^g., Kacian et al. , Biochim. Biophys. Acta 46: 365-83 
(1971)- Yang et al., Biochem. Biophys. Res. Comm. 47: 505-11 (1972); Gerard, et_aL, J. Virol. 15: 785-97 
(1975); Liu et al., Arch. Virol. 55 187-200 (1977); Kato et_aj., J. Virol. Methods 9: 325-39 (1984); Luke, elaL 
Biochemistry 29: 1764-69 (1990); Le Grice et al. , J. Virol. 65: 7004-07 (1991). Additionally, these methods 
have not assured removal of substances that are significant inhibitors or contaminants that interfere with the 
use of reverse transcriptase for target amplification reactions. Another important consideration in the use of 
reverse transcriptases for a variety of purposes is the RNase H activity associated with the enzyme. The 
amount of RNase H activity and the way in which the RNase H activities work in coordination with the RNA- 
and DNA-dependent reverse transcriptase activities are important features affecting the utility of the enzyme 
for various purposes including transcription-based amplification systems. Too much or too little activity, the 
wrong kind of activity (such as non-specific RNases), or activities poorly coordinated with DNA synthesis 
can all lead to reduced performance in a particular application. Proper balance of the synthetic and 
degradative activities must be maintained; this is not only a function of the particular reverse transcriptase 
enzyme used, but also is dependent on the ability of the purification protocol to remove the RNA and/or 
DNA degrading activities. 

The cloning and expression of reverse transcriptases in bacterial hosts has been previously reported. 
Attempts to clone and express reverse transcriptase from avian myeloblastosis virus (AMV-RT) did not lead 
to production of significant amounts of the purified enzyme. This is apparently due to the fact that the AMV- 
RT consists of two polypeptide chains, the « and /3 chains, which must form a dimeric structure and 
undergo specific post-trans lational modifications in order to produce fully active enzyme. These same 
modifications do not occur when the gene is expressed in E. coli . 

By contrast to the avian viral RTs, many reverse transcriptases derived from mammalian viruses consist 
of only one polypeptide chain; efforts to clone and express these enzymes have been more successful. In 
particular, Goff et al. , U.S. Patent 4,943,531 (1990) and Kotewicz et_aL, U.S. Patent No. 5,017,492 have 
described methods for the purification of reverse transcriptase derived from Moloney Murine Leukemia 
Virus (MMLV-RT) and expressed in E. coli , which methods form the basis for the majority of commercial 

: reverse transcriptase preparations. 

Many commercial preparations of reverse transcriptase have been found unsuitable for use in target 
amplification and for other purposes due to nuclease contamination. See Sambrook, supra, previously 
incorporated by reference herein ; Ryskov et_aL. Mol. Biol. Rep. 8: 213-16 (1982). Other problems with 
commercial preparations of MMLV-RT may be related to an altered coordination between the DNA 

} synthesis and RNAse H activities of the purified enzyme, reduced ability to bind and initiate synthesis at 
primer sites or to read through regions of tight secondary structure, or alternately may be due to DNase 
and other protein contamination. See Agronovsky, A.A., Anal. Biochem. 203: 163-65 (1992). Additionally, 
commercial preparations made using the previously available methods for purification show significant lot- 
to-lot variability. 

5 Moreover, due in part to the lengthy and labor-intensive purifications employed, the expense of the 
reagents and equipment employed for scale-up and the low yields of enzyme, the cost of such enzymes is 
prohibitive for their widespread commercial application in target amplification systems. 

It is therefore an object of the present invention to provide an improved form of reverse transcriptase 
having the correct balance of DNA synthetic activities and RNAse H digestive activities, thereby being 

o particularly suited for use in nucleic acid amplification methods. 

It is another object of the present invention to provide a convenient source of reverse transcriptase 
containing low levels of contaminants, such as undesired RNAses, that interfere with transcription-based 
amplification reactions by cloning and expressing a gene coding for an MMLV-RT enzyme having these 
properties in an E. coli host. 

5 It is yet another object of the present invention to reduce the RNAse activity associated with the 
enzyme prior to and following purification by cloning and expressing the MMLV-RT gene in a ribonuclease- 
deficient strain of E. coli . 
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It is another object of the present invention to develop a simple purification scheme for the isolation of 
the e t n ^ y ^ urther object Qf the presept jnventiori t0 provide methods for the purification of the enzyme that 
achieve high levels of purity of RT at a low cost. 

Summary of the Invention 

The present invention features an expression vector or plasmid containing a cloned version of the gene 
for MMLV-RT which, when used to transform a suitable host cell such as E. coli , leads to the expression of 
the gene and the generation of a gene product having the DNA- and RNA-directed DNA polymerase 
activities and RNAse H activity associated with retroviral reverse transcriptases. 

The present invention also features a plasmid containing a gene for MMLV-RT inserted into a host cell 
which has a reduced level of ribonuclease activity as compared to wild-type strains. 

The present invention also includes methods for the purification of the resulting enzyme from the host 
cells, such methods comprising suitable growth media, fermentation conditions, harvesting and storage of 
the cells, cell lysis and chromatography. 

The present invention also features the enzyme produced by the expression vectors, host cells, and 
purification procedures of the present invention. The enzyme is highly-purified and suitable for use in 
nucleic acid amplification and other genetic engineering procedures. 

Finally the present invention features the use of the enzyme produced by the methods described 
herein for 'the synthesis of complementary DNA for a variety of purposes, notably in transcripfcon-based 
amplification and RT-PCR reactions. 

Brief Description of the Drawings 

Figure 1: Construction of plasmid pUC18N 

Figure 2: Oligonucleotides used to construct plasmid pUC18N. 

Figure 3: Alignments of the ribosome binding sites. 

Figure 4: Oligonucleotides used to modify the ribosome binding site and spacer region. 

Figure 5: Construction of plasmid pUC1 8 MMLV Sst-Hind. 

Figure 6: Construction of plasmid pUC18 MMLV III Tailed. 

Fiqure 7- Oligonucleotides used to construct plasmid pUC18 MMLV Tailed. 

Fiaure 8" Construction of plasmids pUC18N MMLV Gly and pUC18N MMLV Gly Tet(-). 

Fiqure 9" Construction of plasmids pUC18N SD9D MMLV Gly and pUC18N SD9D MMLV Gly Tet(-) 

Figure 10: Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) gel photograph of 
P-1 1 and Sephacryl S-200 fractions of purified MMLV-RT. 

Detailed Description of the Invention 

Definitions 

As used herein the following terms have the indicated meanings unless expressly indicated otherwise. 

By "selectable marker gene" is meant a DNA fragment encoding a gene which, when carried and 
expressed by a host cell, is capable of conferring a growth advantage to that host cell as compared to cells 
not containing the selectable marker gene when both are grown in a culture media of a given composition. 
For example, the gene encoding 0-lactamase will confer resistance to amplicillin on host celts containing 
this gene, whereas cells not containing the gene will be sensitive to ampicillin; thus only cells expressing 
the gene for ^-lactamase will grow in media containing amplicillin. Similarly, cells unable to catabol.ze an 
essential amino acid will not grow in media not containing that amino acid, whereas cells containing a gene 
allowing the cell to make the essential amino acid will grow in the same media. 

A selectable marker gene may be covalently linked, for example in a plasmid or expression vector to 
one or more other gene or genetic element as a means of identifying cells containing both the selectable 
qene and the "silent" gene(s) and/or genetic element(s). 

By a "purified" nucleic acid or protein is meant a nucleic acid or protein subjected to at least one step 
which removes cellular components such as carbohydrates, lipids, unwanted nucleic acids, or unwanted 
proteins from the indicated nucleic acid or protein. 

By "upstream" is meant to the 5' side of a given locus on a nucleic acid strand, or in the case of a 
double stranded nucleic acid molecule, to the 5' side of a particular locus with respect to the direction of 
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nonn transection in that region of the nucleic acid molecule. 

9 B 'down S am" is meant to the 3' side of a given locus on a nucleic acid strand or ,n > the > case of a 
double stranded nucleic acid mo.ecule. to the 3' side of a particular locus wrth respect to the d.recfon of 
npnp transcription in that region of the nucleic acid molecule. 

9 By "l!" Tmeant the temperature at which 50% of a population of a ^ b '^f ""^J 
molecules, or nucleic acid molecules having a double-stranded region, become smgle-stranded or thermally 

By' ^recombinant- is meant that a nucleic acid molecule or protein is at least partially ther^tof^ 
vitro biochemical techniques. A "recombinant DNA molecule" is thus a non-naturally occurring molecule. 
Such ecomb in ant molecules include, but are not limited to molecules which compnse restriction en- 
donuc ease foments, in vitro nucleic acid ligation products, in vitro exonuclease fragments, and ex P ress,on 
vectors comprising heterologous genetic elements such as one or more of the following: promoters 
"preTsor genes ^selectable barker genes, temperature-sensitive DNA replication elements, structural 

9ene » S ReTomtrn a nt" proteins or enzymes are those not found in nature. These include purified protein 
P re P a?aSns and Pr oteins produced from recombinant DNA molecules. The latter proteins are usually 
expressed n a heterologous host cell. i.e.. one not native to the protein or enzyme ,n question^ However 
"e gene encoding a recombinant protein may reside on an expression vector contained w,th,n a host cel. of 
the same species as the organism from which the protein in question was denvecL 

By "truncated" is meant a smaller version of the gene or prote.n in quest on. with respect to the 
primary nucLt id e or amin0 acid sequence, a truncated form of a reference nucleic acid or prote.n ,s one 
that lacks one or more nucleotides or amino acids as compared to the reference molecule. 

By "substantial sequence homology" is meant that a first nucleic acid or protein^ J™ 
recognizably non-random similarity to a second reference nucleic acid or prote.n over at least about 89 /o of 

' " t~ t£^^~T^L at .east one definite region of contiguous nucleotide or 

amin B?"o%"ofre S p .icati 0 n" is meant a specific region of DNA at which primer production and initiation of 
DNA polymerase activity begins. In this specification, the term is used to mean a nucleic acid element 
o present or ™ J DNA expression vector that a..ows the expression vector to increase ,n copy number within a 

''"By^romoter" is meant a genetic element comprising a specific region of DNA at which , an RNA 
polymerase enzyme can bind and begin transcription of a DNA template thus Providing the firs step of 
translating the genetic information contained in the sequence of a nuc.e,c acd ,nto the production of a 
nrotein of an amino acid sequence corresponding to that nucleic acid sequence. 

P By CresTn", "gene expression" or "protein expression" is meant the products of proton from 



^eani^h^c;, method of inducing a host eel, to internalize a nuc.ic acKj 
molecule Such nucleic acid molecules are usually genetic elements compnsmg at least an ong.n of 
re3SSn.^le^ble marker gene, and a promoter for expression of the selectable marker gene withm 

h B 0 y St " C hetero.ogous" is meant not of the same species. Thus, an enzyme expressed in a heterologous 
host Mil is pmduced in a host cel. of a different species than the one from which the enzyme was origmally 

^ By "gene" is meant a nucleic acid region having a nucleotide sequence that encodes 
protein or polypeptide. A gene may comprise one or more "coding sequences" contammg codons that 
correspond to amino acid residues of the expressed protein; the gene may also compnse, but need not 
comprise one Tmore "non-coding" nucleotide sequence regions that do not contain codons correspond- 
ing to amino acid residues of the expressed protein. exD ression 
All of the biochemical techniques, used for construction and evaluate of the MMLV-RT expression 
vecto s inc uding, but not limited to. restriction digestion protocols, gel electrophoresis Southern blot and 
DNTmodif ca ion reactions, are known to those of ordinary skill in the art and are described in Sambrook ej 
al su^ra Srwas previously incorporated by reference herein. In addition, many of these techniques are 
IsSdTn the co-pending application of Riggs, et^L entitled Purified DNA Polymerase front Bacillus 
S^rnophilus which enjoys common ownership with and was filed the same day as this application. 
This application is hereby incorporated by reference herein. 
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I. Construction of the Cloning Vector 
a. Plasmid pUC18N 

Plasmid pUC18 (Life Technologies. Inc.. Bethesda. MD) was used as the parent vector. Clones were 
screened by restriction mapping techniques on agarose gels; such techn.ques are well known ,n the art. An 
Nco I restricted .site was introduced between the lacZ ribosome binding site and the Eco R. restriction site 
of picTs by making a substitution of two nucleotide bases, as shown in Figure ^ 
introduced using the'two synthetic o.igonuc.eotides shown in Figure 2 as o'^jf^ and 2 '. (SEC ID 

, 0 NO- 1 and 2). As shown, the oligonucleotides overlap by 30 base pa, s at heir 3 ends The 
ofgonuCeotides were al.owed to hybridize, filled in using the K.enow ^d w .t 

and dioested with Pvu II and Eco Rl. Plasmid pUC18 was digested with Eco Rl and partially digested with 
Pvu ?I to^ld Co DNA fragments: a larger fragment including the intact ampici.lin resistance gene Amp) 
the o iqin of replication (Ori), and part of the lac Z gene. The smaller Eco Rl-Pvu II fragment cons.sted of 

J5 Z portion , of tSe lac Z gene corresponding tuitions 450 to 628 of the P UC18 map. The synthe .c Eco 
R| P r fragment was inserted into the larger vector fragment, ligated and used to transform 
JM 109 Clones containing properly constructed vectors produced a blue color using an X-ga. substrate (5- 
b^omo 4-chloro-3-indo,y.-'-D-ga.actoside) as a substrate, indicating that the JacZ gene had bee" properly 
^constructed. These results were further verified by restriction mapping. Th.s vector was named P UC18N. 

20 (See Figure 1). 

b. Construction of Plasmids Containing the Reverse Transcriptase Gene. 

The intact MMLV gene was isolated as an Sst l-Hind III fragment from the pMMLV-L clone described in 
25 Miller and Ve ma, J. Wol. 49:214-222 (1984). This fragment contained the nuc.eot.de sequence conespond- 
Z to the region fToT^MLV position 2558 (Sst . site) to position 4894 (Hind l.l s.te) and con - 
entire RT gene between 40 extra upstream bases and 284 extra downstream bases. Plasm.d vector P UC18 
Agisted with Sst I and Hind Ml, and the vector and RT gene we. ^J^^IS 
transform competent E. coli DH5«f cells. The resulting plasm.d was named pUC 8 MMLV Ss t-H,nd ^(figure 
30 5) This plasmid waslheTdigested with Eco Rl and Bg. I, yielding a 2013 bp fragment of the -MMLV-RT 
qenetcking the terminal 3' sequences of the RT gene. The RT gene fragment was .gated at .ts Bg. I Is te 
S a doubistranded linker designed with Bgl l-Hind III overhangs (figure 7) from *J> «yn«J« 
oligonucleotides 8 and 9 (SEQ ID NOS-.17 and 18, respectively). The synthetic ^""^^^ 
sequences for the carboxyl terminus of MMLV reverse transcriptase and a stop code ,n ■ p ^'^ u ™ w " 
35 tested with Eco Rl and Hind III. and the large vector fragment was gel purified ^ ®^ ^ ^ 
reconstructed RT gene. The resulting plasmid was called P UC18 MMLV ... Tailed, and contained the MMLV 
gene with the extra 3' sequences removed. 

c. Construction of pUC18N MMLV Glv and P UC18N MMLV Gly Tet(-). 

The extraneous 5' sequences of the cloned RT gene were removed as follows^ A 1997 bp Mam .-Hind 
„, fragment was isolated from pUC18N MMLV III Tailed (Figure 8). This nuc.eic acd fragment conned the 
RT qene without the 5' twenty-three nucleotides of the MMLV-RT gene sequence. Two complementary 
SLnucUdes were synthesized and hybridized to recreate the 5' portion of the RT gene (but w.th 
, 5 nuSo^coding for a glycine in the second amino acid position and a Nco I 5" overhang contammg an 
initiation codon, as shown below. 

Oligonucleotide #3 (SEQ ID NO : 3 ) CATGGGTCTG AACATCGAAG 
50 ATGA 

Oligonucleotide #4 (SEQ ID NO:4) TCATCTTCGA TGTTCAGACC 
5 ' -CATGGGTCTGAACATCGAAGATGA-3 ' 
3 ' - CCAGACTTGTAGCTTCTACT - 5 ' 

55 

P,asmid DUC18N was digested with Nco I and Hind 111, and the smaller of the two ™ u ^^™^ 
was removed The hybridized oligonucleotides (SEQ NO:3 and 4) were ligated to the .arger P UC18N 



BNSDOCID: <EP 0688870A1_I_> 



EP 0 688 870 A1 



fragment at the Nco . site, and the 1992 bp MMLV-RT Mam l-Hind IH gene was then inserted as well, 
vieldina the expression vector pUC18N MMLV Gly (Figure 8). The Tet gene from pUC18 Tet( + ), 
cons ucted las described below was inserted at the Aat I. site, and the resulting , plasm.d was called 
P UC 8N MMLV Gly Tet(-). The minus sign refers to the orientation of the Tet gene wrthin « 

5 The cloned MMLV-RT of the present invention differs from the nat.ve enzyme ,n two , respecte. First the 

codon encoding the threonine residue which occupies position 1 of the nat.ve enzyme the second codon of 
the RT gene) has been replaced with a glycine codon in the cloned RT of the present mven ton .secondly 
the codons for the leucine, asparagine and isoleucine residues occupymg ammo acid ^.ons2 cjT C o6on 
the mature native protein sequence were replaced with codons more preferred by Bcoli. The CTA codon 

,o codi^foTreuc'e was replaced with degenerate codon CTG; the AAT codon coding for asparag.ne was 
placed with degenerate codon AAC. and the ATA codon coding was replaced w,th degenerate codon 
ATC. (See Wada. K. et_aL, Nucl. Acids Res. 19(supp.):1 981 -1986 (1991)). 

d. Construction of Plasmid pUC18N SD9D 

In order to optimize the expression of cloned MMLV-RT, the lac_Z ribosome binding site (RBS) of 
PUC18N wis modified to contain 9 bases complementary to E^oh 16S rRNA rather than on the 4 such 
bases present ,„ the pUC18 parent vector. At the same time, plasmids were constructed having space 
reoTon ^s^ngfte RBS and the ATG initiation codon by either 7, 8. or 9 base pairs, as shown for one of 
20 S^STJS. e 3 as SD7, SD8 and SD9. Common elements in the design of these spacer sequences 
we^S^e (A) in the third position 5' to the ATG initiation codon, 2) no guan.ne (G) 
Tn he spacer region except in the Nco I site, and 3) an 5'-RRTTTRR-3' sequence spann.no the RBS and 
he spacer, where T is thymine and R is a purine nucleotide (adenine or 9"™*^ comma -laments 
for heterologous gene expression were suggested in Jay et_aL. Proc. Natl. Acad. Sc.. USA 78. 5543 48 
pi HQR-n anH .Iftspers et al- Pr otein Engineering 4: 485-92 (1991). 

( T e oligonucleotidis" used to introduce these modifications are shown in Figure ^ 
and 7 (SEQ ID NO:5. 6 and 7) were each used in conjunction with ohgonucleotide 1 _sh ,own ■ F,gure 2 The 
nucleotide 4 bases on the 5' side of the ATG start codon of « * ^T si nce 

5' side by the ATG start codon in oligonucleotide 7 were synthes.zed wrth a rmxture S '™° 
3 o neither was theoretically preferred. See Jespers. etaL, supra As in the construction of pUCIBN. 3C I base 
pair ^^re g r 0 n of cornpiementarity exisled between oligonucleotide 1 and each of o ,gonucleot,des 3, 4 and 5 
As befo e each pa" of oligonucleotides was allowed to hybridize, was filled i in us.ng the Wenow ^fragment * 
E coli DNA polymerase I. digested with Pvu II and Eco R. and inserted mto the same la ge ^^CJ8 Pvu I. 
EccTri fragment used in constructing P UC18N. The MMLV-RT gene was then cloned ,nto th.s vector as a 
" ^es^^ *■ -e,s of MMLV-RT expression. The cells containing 

the p'asmid wTthe 9 base spacer (SD9; oligonucleotide 7) displayed the highest level of reverse 
t^nsSta e expression. The plasmid was isolated and sequenced both of ^ d ^ te _^^ 
and 5 bases on the 5' side of the ATG start codon were found to be adenosme (A) residues. The expression 
40 vector was named pUC18N SD9D. 

e. Insertion of the Tetracycline Resistance Gene 

The ampicillin resistance (^-lactamase) gene of pUC18 was used as a genetic selection marker in the 
45 early vector constructions. However, owing to the fact that ^-lactamase acts to destroy the arfitao c 
relaLely quickly, there may be a sizable plasmid-minus revertant population m a culture .n which amp.c.ll.n 

i5 VZ::T^lZ^e the cel. population in the cultures, the vector was modified to contain a 
tetracyct r si ance gene. Because tetracycline acts to block cellular uptake of the ; ant^t-c rather than 

50 Sating it, the culture should be more stable in the presence of tetracychne than ^ «™^ n - 

The tetracycline resistance gene was isolated from pBR322 as a 1427 bp Eco Rl-Ava I fragment. The 
sin 8 rstn^nTangs were fiL in using the Klenow fragment of Bcoji DNA polymerase L y.e.d^ a 
blunt ended fragment Aat II linkers were ligated to the tetracycline resistance gene fragment and digested 
wl Aat II Plasmid pUC18 was digested with Aat II, and the linearized vector was l.gated to the Aat I 

55 Sment obtaining tfe tetracycline Li stance gene. The ligation mixture was used to 

E coli JM109 cells, and the transformants were selected by tetracychne resistance. The structure of the 
la^d was verified by restriction mapping. Cones were selected having the tetracychne res.stance gene 
inserted in both orientations; the plasmids were named pUC Tet( + ) and pUC Tet(-). 
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The two plasmids were used as a supply of the tetracycline resistance gene (Tet fo ' '^°" s 
Dlasmids containing cloned MMLV reverse transcriptase. This approach was preferable to *emphng to 
insert the Averse transcriptase 9 ene(RT) into a vector already containing the Tet gene, since the Tet gene 
co^ns relSn Stes for enz'ymes used in the reverse transcriptase Coning, while the RT gene contams 
5 no Aat II sites. 

f Construction of pUC1 8N SD9D MMLV Gly and DUC18N SD9D MMLV Gly Tet(-). 

The intact modified reverse transcriptase gene from P UC18N MMLV Gly Tet(-) was isolated as a 2018 

,o bp Nco « S mCment and ligated with vector P UC18N SD9D from which the Nco l-H.nd 1 porker 
lion hid been removed. The resulting plasmid, called pUC18N SD9D MMLV Gly, contained the MMLV- 
RT 9 gene modmed in the three ways described above in addition to having the improve ^--e b-nd-ng 
stte and spacer region, as described above. This plasmid was cleaved at .ts un.que Aat II site , and the Aat I 
Tet aene ^fragment from pUC18 Tet< + > was inserted into the vector and ligated. Plasm.ds contam.ng the Tet 

, 5 ofne insert were"solated in both possible orientations, and the level of RT expression was tested for clones 
Sonta inTng e^h P .asmid. The Cone having the Tet gene in the (-) orientation (with the coding strand in the 
samronentation as MMLV-RT) was found to produce higher levels of RT than the clone having the Tet 
gene in the opposing orientation and was therefore chosen as the preferred clone. 

20 ||. Selection of the Host Cell Strain 

The following E. co!i strains were tested for expression and purification of MMLV-RT: JM109, DHB-P. 

activitv assavs on crude cell lysates. Of the RNase I deficient strains, E. coli 1200 (Yale University tc^ 
30 G Tnl Sock Center, Strain^) consistently showed high levels of enzyme expression using these 
assays; unless indicated otherwise, all experiments described herein were conducted us.ng this stram. 

III. Growth of E. coli 1200 containing pUC18N SD9D MMLV Gly Tet (-) 

The fermentation culture medium (A-Z Amine media) contained the following components in a volume 



N-Z Amine A (Sheffield Products, Norwich, N.Y.) 

Yeast Extract (Difco) 

NaCI 

NaOH 

Tetracycline (12 mg/ml in 70% ethanol) 



2 kg 
1 kg 
1 kg 



The mixture was autoclaved in the fermentation vessel at 121 -C for 20 minutes, then allowed to cool. 
Thp tetracycline was added when the temperature reached 37 • C. 

Thr'noclm of E. coli 1200 containing pUC18N SD9D MMLV Gly Tet(-) was prepared by inoculating 2 
ml ol N-fAm ne plus-mig/m. tetracycline (LB + Tet) with a frozen stock culture of the vector^nmfl 
Hn and fnTuba^g overnight at 37-C with shaking. The resulting 2 ml culture was then used to mocu.ate 
?n one-liter cultures which were again incubated overnight at 37 • C with shaking. 

^J^h^SnwL was then inoculated with 20 liters of seed culture, and the cells were allowed to 

fermentation, and rose during that time to approximately 8.2. c ham . es centrifuae 

The culture was chilled to 20 • C. and the cells were collected by centnfugation in a Sharpies centnfuge_ 
The eel were not washed. The cell paste was divided into 200 g portions and frozen in hqu.d N 2 . During 
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freezing, the cell mass was broken into smaller pieces to ensure rapid and thorough freezing. The frozen 
cell paste was then stored at -70 * C. 

IV. Purification of MMLV Reverse Transcriptase from E. coli 1200/pU C18 N SD9D MMLV Gly Tet(-) 

1. Assay of Reverse Transcriptase Activity and Protein C oncentration. 

Methods for assaying reverse transcriptase activity are known in the art. For the work described here, 
the dT^A assay described by Kacian was used (Kacian, Methods for Assaying Reverse Transcriptase, in 
Methods in WQlogy (Academic Press 1977). herein incorporated by reference as part of this disclosure^ 
bnTunftofre ^transcriptase activity converts 1 nmole of dTTP to acid-prec, P ,table form ,n 10 minutes 
under the conditions described therein. 

2. Cell Lysis 

Eleven hundred grams of frozen cell pastes were broken into pieces and suspended in 3.3 liters of 
Lysis Buffer (25 1 Tris-HCI (pH 7.5), 10 mM ethylenediamine tetraacetic acid (EDTA), 10% (vA,) g.ycero 
5 mM dithiolthreitol (DTT), 1% (v/v) Triton X-100, 10 mM NaCI. 1 mM phenylmethylsufonyl I fluor.de (PMSF)) 
by s^ng a 4-C The cells were then lysed by 2 passes through an APV Gaulin ^MR homogen.zer a a 
cLtinuous pressure of 8,000 psi. The receiving vessel was kept in an .ce water bath and he .nftal 
ZToaZxe ^s ZLe* o chiS for 30 minutes prior to the second pass. The lysate was then cleared by 
SlSSn - 4,500 x g for 1 hour at 4-C. and the pellet was discarded. The c.anfed lysate was e,ther 
used immediately or stored frozen at -70 'C and brought to 4-C before use. 

3. Phosphocellulose Column Chromatography 

Phosphocellulose (Whatman P11. 100 g) was treated with 2.5 liters of 0.5 ^ ^J^^^ 
liters of 0 5 N HCI, as recommended by the manufacturer. After a f.nal water wash, the phosphocellulose 
waTsuspended in 1 0 I of 1.0 M Tris-HCI (pH 7.5), allowed to stand for 5-10 minutes, and transferred to a 
BuchnerTunneT The buffer was removed by vacuum filtration, and the phosphocellulose was washed with 
?0 M Ths HCI (pH 7.5) until the P H of the effluent matched the pH of the ^<J- ?J 

phospUelkTse was transferred to a beaker and suspended in 1.0 I of column buffer 25 mM Tns-HC. (pH 
7 5) 1 mM EDTA 10% (v/v) glycerol. 1 mM DTT. 0.1% (v/v) Triton X-100 and 1 mM PMSF) containing 0.05 
M NaC™ After 5-1 0 minutes the buffer was removed under vacuum filtration as described above^ The 
pTspLe^ose was "hen suspended in 700 m. co.umn buffer containing 0.05 » 

All subsequent steps were carried out at 4'C. The chromatography was earned outusmg Pharmaaa 
FPLC equtment A Pharmacia XK 50/30 (5.0 cm x 26.0 cm) was packed ^^J"^^ 
nhosohocenulose to give a bed of 500 ml. The column was then washed with 1 I of column buffer 
SSnS ?^M N& at a flow rate of 60 m./hour. Column adapters (Pharmacia AK 50) were used to 
SnS3i"^S vo ume at the ends of the column. Six hundred ml of clarified cel. lysate wer e applied to 
L Tdumn at a flow rate of 30 mi/hour. The column was then washed with 650 ml of column buffer 
containing 02 M NaC. at the same flow rate. Because of shrinkage of the co.umn bed. excess buffer was 
™ed from the space above the co.umn bed, and the top flow adapter was readjusted to maintam 

^r—^^with a 1500 m, linear sa,t gradient, from 0.2 M NaC, to 0.7 "™**£>% 
buffer at 30 ml/hour. The effluent was monitored for the presence of protein by .ts absorbance a 280 nm. 
FrSons of 25 ml were collected except during the elution of the protein peak, during wh.ch 15 m. fracbon. 

TnTSumn fractions were analyzed using SDS-po.yacrylamide gel e.ectrophoresis (SDS-PAGE) fo.- 
■ .owed by SLssie Bri.Nant B.ue staining. SDS-PAGE is we., known in the art and ,s desenbec Mr, 
Laemmli U K Nature 227:680 (1970), which is hereby incorporated by reference herein. Ten m'crolrtBrs 
fro* Teach fracture analyzed in each gel lane. A control .ane contained a known amount of punfied 
MMLV r^T Those fractions containing a significant amount of protein migrating with an apparent molecular 
^Z'l,^^c7X^ of the MMLV-RT control and which contained little visible contaminating 
- protein bTnds were pooled. Approximate* 95% of the protein eluting within the major prolan peak was able 
' ote pooTed 'Z L P including significant amount of contaminating ^S^Z^^S 
used to locate and pool the peak MMLV-RT enzyme fractions; such procedures are known to those of 
ordinary skill in the art. 
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4. Sephacryl S200 Gel Filtration 

The pooled phosphocellulose fractions, having a volume of 80-100 ml, were concentrated to less than 
25 ml by ultrafiltration in an Amicon ultrafiltration cell using an Amicon P30 membrane at 20 ps. of ^nrtrogen 
Two 26 cl x 94 cm Pharmacia XK 26/100 columns were packed with Sephacryl S200 (Pharmacia) 
according to the manufacturer's directions. Column adapters were used to minimize the dead volume. Both 
columns were connected in series. The columns were washed with 2 . column buffer conta.n.ng 0^2 M NaCI 
STflow rate of 90 ml/hour. The concentrated phosphocellulose Pool (about 25 ml) was loaded onto the 
upstream column, and the column was developed with the same buffer at a flow rate of 90 ml/hour Again 
he effluent was monitored for its absorbance at 280 nm; the initial 200 ml of eff ^^^^ 
single pool, and 4 ml fractions were collected during the elution of the prote.n peak. The MMLV-RT eluted 
when approximately 290-300 ml of buffer had been applied to the columns. 

The fractions were again analyzed using SDS-PAGE as above. Three microliters from each fraction m 
the peak region were runln each ge. lane; as before, a control lane contained purified MMLV-R a ^nown 
mass. Those fractions containing a significant amount of protein migrating with the purified MMLV-RT and 
which contained little visible contaminating protein were pooled. Preferably, those fractions containing 
predominant bands of a higher apparent molecular weight than the MMLV-RT were not included « thepooL 
Between 95-98% of the protein in the major S200 peak was included ,n the Pool. Although assays for 
reverse transcriptase activity may be used to locate and identify the MMLV-RT ,n the fractions analysis 
oreferably includes SDS-PAGE to avoid including higher molecular weight contaminants in the pool. 

The pooled S200 fractions are sufficiently concentrated for most uses. The enzyme can be stored in 
50% glycerol at -20 * C. 

Example 1: Expression of MMLV-RT by E. coli co n taining pUC18N MMLV Gly Tet(-) or pUC18N MMLV Gly 
Tet(-) with a Modified Ribosome Binding Site and Spacer Sequences of Different Lengths 

The MMLV-RT gene containing the glycine amino acid substitution in the first position was evaluated in 
vector pUC18N and P UC18N with the spacers and modified ribosome binding site described above. All 
vectors contained the Tet gene and were evaluated in E. coli strain 1200. 

Fifty ml cultures of E coli 1200 containing either of these two contracts were grown for 16.5 hours at 
37 -C with shaking. Aliqu^f 0.5 ml were harvested, centrifuged for 2 minutes in a n^^>' 
the supernatants were discarded. The cell pellets were resuspended on 0.5 ml of a wash buffer (50 mM 
Tris-HCI (pH 8.0), 10 mM NaCI, 5 mM EDTA and 0.25 M sucrose) and then 

pellets were frozen at -80 -C and then resuspended in 200 ul of lysis buffer (10 mM Tris-HCI (pH 8.0), 10 
mM NaCI, 1 mM EDTA. 1% glycerol. 5 mM DTT. 0.2 mM PMSF and 100 ug/ml lysozyme) and left on ce 
for 20 minutes. One hundred microliters of 0.75% (v/v) Triton X-100 was added to each sample, and the 
m xture was frozen and thawed twice. The lysate was cleared by centrifugation and to a. prote.n was 
assayed by the method of Read and Northcote (Anal. Biochem. 116:53-64 (1981)). the disclosure of wh.ch 

is in A s ?hVr; e a ; n ~aye d * «*« ****** ^ ^ * rr ^^ 

action each clone was calculated in terms of units per microgram of total protein ,n the lysate as well as 
un s per ml of bacteria, culture. The results shown in Table 1 indicate that the < vector <^™"Q ^ 
modified ribosome binding site (RBS) and the 9 base spacer sequence expressed the highest levels of 



Comparison of RT Expression in Different Plasmid Constructs 


Expression Vector 


RT activity; U per 


RT activity; U per ml culture 


u,g total protein 




unmodified pUC18N 


1.81 


746 


pUC18N with 7 base spacer and improved RBS 


2.25 


823 


pUC18N with 8 base spacer and improved RBS 


1.72 


679 


pUC18N with 9 base spacer and improved RBS 


2.69 


1,036 



10 
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Example 2: rv^psrisnn of Modified MMLV-RT in E. coli 1200 and JM 109 Host Strains 

Plasmid pUC18N was used to create plasmids encoding MMLV-RT with glycine alanine, or valine 
substitutes in the first native amino acid position. These substitutions were created us.ng ol.gonucleot.des 
m To gos 3 a nd 4, but with a codon of sequence 5--GTT-3' or 5--GCT-3'(coding for val.ne or Canine 
eToect vely) in the second position of the RT gene, following the initiator, codon The Tet gene from 
pJSa Te (I was inserted L the resulting plasmids in each orientation for compar.son. These p.asm.ds 
wenl usld to reform E. coli JM109 host cells which contain an episomal copy of the lac repressor lac «« 
TZ llV ransfoVrn^t-celiTwere grown overnight as in Example 1, except when the cells reached log 
9 p e g owS the ^0"^ induced by the addition of 0.5 mM (IPTG) for approximately 22 hour, 
Sauots wtre halted and assayed for reverse transcriptase activity as in Example 1. The results are 
sSn^ieTSS. As can be seen, the G.y Tet(-) construction showed the highest level of enzyme 
expression. 

Table 2 



Effect of Orientation of Tet gene on RT Activity 




RT Activity; U per ug Total Protein 


RT Activity; U per ml Culture 


Gly Tet( + ) 


0.44 


177 


Gly Tet (-) 


1.29 


472 


Ala Tet( + ) 


0.59 


229 


Ala Tet (-) 


0.54 


243 


Val Tet( + ) 


0.91 


400 


Val Tet (-) 


1.03 


395 



In a separate experiment, the Gly Tet(-) and the Val Tet(-) constructs were evaluated in E^coli hosts 
1 200 and JNM 09 The JM 109 cultures were induced as above, while the 1200 cultures were un.nduced 
The resul shown below in Table 3 indicate that the levels of expression in both strains are comparable for 
the Gly substituted MMLV-RT. and higher in strain 1200 for the Val subsftuted plasm.d. 

Table 3 



Comparison of RT Expression in Different Host Cell Strains 




RT Activity; U per ug Total Protein 


RT Activity; U per ml Culture 


1200/Gly Tet(-) 


1.04 


591 


JM 109/Gly Tet (-) 


1.05 


533 


1200/Val Tet(-) 


1.00 


516 


JM 109/Val Tet (-) 


0.61 


357 



Example 3: Growth of E. coli 1200/pUC18N SD9D M MLV Glv Tet(-) and Expression of MMLV-RT 

One liter of growth medium contained 10 g of N-Z Amine A, 5 g of yeast extract, 5 0 of NaCI. andO.1 
ml of 10 N NaOH. One mil.iliter of 12 mo/ml tetracycline in 70% ethanol was added to the cooled. 

"mlTftTdium was inoculated from a frozen stock culture of the E^coli *«f^ ™» ^ 
55 allowed to grow overnight with shaking at 37 • C. The two ml bacterial culture was used to inoculate 500 ml 
o medium and this culture was grown overnight as above. The 500 ml culture was. in turn used to 
inocu^e "'.iters of medi m in a New Brunswick BioF.o II. fermenter. The culture was grown at 37'C w-th 
stirring at 350 RPM . The culture was sparged with air at 4 liters/minute during fermentafor, Five to ten 
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samples were taken every hour for measurement of pH, optical density, protein 
transcriptase activity. These results are shown in Table 4 below. 

Table 4 



Growth Kinetics of 1200/pUC18N SD9D MMLV Gly Te 


(-) 


Sample 


Time(hr) 


pH 


A600 


Protein (mg/ml) 


RT Activity; U/Assay 


RT Activity; U/mg Protein 




0* 


6.93 


0.00 


0.00 


0.00 




2 


0" 


7.15 


0.29 


0.06 


1.17 


2920 


3 


1 


7.14 


0.22 


0.18 


0.91 


2270 


4 


2 


6.97 


0.69 


0.07 


0.51 


1270 


5 


3 


6.91 


0.96 


0.11 


0.87 


2170 


6 


4 


7.02 


1.72 


0.09 


0.92 


2300 


7 


5 


7.40 


2.16 


0.13 


0.60 


1510 


8 


6 


7.70 


2.50 


0.11 


1.27 


3180 


9 


7 


7.83 


2.91 


0.12 


1.49 


3720 


10 


8 


7.98 


2.79 


0.09 


1.46 


3660 



*Pre-lnoculation 
**Post-lnoculation 



Example 4: Large-Scale Purification of Cloned MMLV-RT 

The enzyme was prepared as described in Example 1 above. Volumes of reage nts were adjuste d in 
propoln tothe weight oHhe pe.leted cells at the beginning of the procedure. As indicated ,n Table 5. h.gh 
purified enzyme was recovered with a 48% yield. 

Table 5 



Purification Parameters: MMLV-RT Purification Scale-Up 




Fraction 


Volume (ml) 


Protein (mg) 


Total Activity (U) 


Specific Activity (U/mg) 


Yield (%) 


Crude Lysate 


605.6 


2.1 x 10* 


1.5x 10 s 


7,100 


100 


P-11 Pool 


15.2 


741 


8.2 x10 7 


110,656 


52 


Sephacryl Pool 


79 


363 


7.9 x 10 7 


217,400 


48 



Example 5: SDS-PAGE of PurifieH MMLV-RT from 1200/pUC18N SD9D M MLV Gly Tet(-) Clone 

The progress of the purification was monitored by SDS-PAGE analysis of protein in the P-11 pool and 
50 the leohacrv pool of Example 4 above. SDS-PAGE was conducted in a 10% reducing gel essentia^ as 
2^2??n uSlli. supra. Samples were prepared as follows. An aliquot of the P-11 poo was diluted I 50- 
0 it a "l sarnie buk (50 mM Tris-HCI (pH 6.8), 10% (v/v) glycero.. 5% ^-mercaptoethano (BME* 
2% wM SDS and 0.05% (w/J) bromoheno. blue) and heated at 95«C for five minutes. An aliquot tftrorn : the 
liSy. column pool wa diluted 10-fo.d with gel sample buffer and heated ,n he same way. A sample o 
55 ^mme c Jly obtained MMLV-RT (USB, Cleveland, OH) was prepared identically. The latter sample was 
ZrteTby the supplier to have a specific activity of 187,000 U/mg and was P rov,ded in an , into 
concen'ation 0 ? 1500 U, u |. Prestained molecular weight markers (Bio Rad Laboratories. Sar , RafaeL CA 
were I ^ eTto estimate the molecular weights of the proteins contained in the sample pools. The apparent 
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molecular weigh, o, the ma*er proteins 
and is shown in Figure 10. 

Table 5 



Order of SDS-PAGE Samples in MMLV-F 


=IT Purification 


Lane 


Sample 


Volume (ul) 














3 


p-11 Pool 


2.0 


4 


P-11 Pool 


4.6 


5 


p-11 Pool 


7.0 


6 


P-11 Pool 


10.0 


7 


Sephacryl Pool 


6.5 


8 


Sephacryl Pool 


5.0 


g 


Sephacryl Pool 


4.2 


10 


Sephacryl Pool 


3.3 



Exarn p,e 6. Contaminating Ribonuclease Activity in a Com me rcial Preparation of MMLV-RT 

complementary to a region of the BNA transcript were a ethylene glycol bis (beta-amino 

lithlum chloride. 2% t»M lithium , laury, sulphate 20 ™M ^JA ^ h *t =l <impany. Milwaukee, 
ethyl ether) N. N, N 1 , N 1 tetraaceac acid f m " ™™™ '*° ut e S Three hundred ml of a solution 

Wisconsin) and the reason , «* « ^^J^ mixtljre „ as incubated 

" Jcf7nE£ Tdei; ac^umelr present on unfiled probe. The amount o, remaining 
of radiolabeled RNA by monitonng conversion from invention . other methods for 

assayed lor reverse transcriptase and RNAse activities, as describe labov* 
The resuHs of two identical column runs are shown ,n the following table. 
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Table 6. 




ac.it,. This RNAse a* »J*« »haTS*"ZZ by t <l«on ch rom a^rapt,y. a, leas, four peaks 

sr« "i. ^ ^ - ~ - a^:s 
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elutes with the MMLV-RT. 

Example 7: Comparisons^ MMLV-RT from 

E Coli Host Cells JM 109 and 1200 

the assay described in the previous example. 

The results obtained for each cell type are shown in the followmg tables. 





E. Coli Strain 1200 




Fraction 


RT Activity (RLU) 


RNAse activity 
(% degraded) 




1 


1007 


0 


20 


5 


1084 


0 




10 


1021 


0 




15 


3712 


0 


25 


20 


38359 


0 




25 


20741 


0 




30 


316513 


0 


30 


33 


346922 


0 


36 


504196 


0 




39 


387533 


0 




42 


371897 


0 


35 


45 


472248 


0 




48 


1199993 


0 




51 


1529015 


0 


40 


54 


1126592 


0 




57 


1034428 


0 




60 


850009 


0 


45 


63 


698462 


0 


66 


390121 


0 




69 


177736 


0 




72 


260049 


0 


50 


76 







55 
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Table 8 



E. Coli Strain JM 109 



Fraction 


R.T. Activity 
(RLU) 


RNAse activity 
(% degraded) 


1 


1103 


0 


5 


1238 


0 


10 


1287 


0 


15 


28359 


29 


20 


50927 


75 


25 


29551 


70 


30 


350732 


83 


35 


198151 


30 


33 


164047 


54 


,1 


149647 


66 | 




44 


161963 


62 


47 


674123 


81 


50 


2060603 


83 


53 


2703286 


85 


1 56 


1967435 





59 


1608490 


90 


62 


782936 


86 


65 


265569 


78 


68 


147948 


63 


71 


78481 


38 


74 


44426 


3 




19964 


0 


r-=- 


13900 


0 



45 The data show that the enzyme prepared from JM 109 cel.s 

RNAse H ribonuclease activity throughout the P11 column profile. Significant amount of RNAse Kbvrty 
i the -erse transcriptase activity. In contrast, the reverse • tran^nPtMa purged from t ej_cg 

1200 cells was free of detectable contaminating RNAse activity after the crude extract was ported by 
so phosphocellulose column chromatography. 

Example 8- ^n'^ *"™ rrf Mycobacterium tuberculo s is Ribosomal RNA Trngf Sequence Using Purified 

E ^± ^TV^ ^ Se Transcriptase Fror rTETcoliTgOt ^ 
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occur under stringent hybridization ^"'J''^^ mixture , and the reaction 

SSIS^ - — r a ^tS^i-on tu b es were then transferred into 
a 42'C water bath and ^^^na 46 8 ul Dilut ion Buffer to a tube and adding 1.1 u. 

^vxz e ™ s — w as ^ ad ded * e ach The 

, ^^^^ 

iSdbShrt which are incorporated * "^^^ how n ^Te 9 below indicL that saturating 
the negative control, which was run m 6 ^ a ^^ S ^ e as 2.5 fg of input template RNA at the 
levels of the amplified target sequence are obtained with as iraie as « y 
beginning of the experiment. 



Table 9 



Sensitivity of Enzyme Preparation in an Amplification Reaction 



Amount of Template RNA Added (fg) 



250 



Signal (RLU) 



2841164 



Eliam p„ 9: «-»■■*. o. cDNA u*, ******* MM! V-BT frnm E. ooH ^OO/pUdSM SD9D 

MMLV Gly Tet(-) 



reaction 



'TTE buffer was prepared by mixing 20 ml 1 M Tris-HC. (pH 7.5), 0.4 mM EDTA (pH 8.0) and 281.7 u. 
triethylamine. Primers had the following sequences: 

SEQ ID NO:8 S'-TACCTrGTrACGACTTCACCCCA-a 1 
SEQ ID NO:9 5'-CTTAGATGCTTTCAGC-3' 
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, u , a ,; th 32 P ot thnir 5' ends usinq polynucleotide kinase; procedures for end- 
The primer were ' ? » ^ a^Atte" being end labelled, the primers were purified by 

iV^ll™^ England Nuciear, according to the manulacturer's speCica- 

. ''"ns either puritied recombinant MMLV-RT trom E^b, 1 2 00,pUC18N 

5 SD9D MMLV Gly Tet(-) or reverse transcriptase ^ ur ^^^.^° 1 '^^ ^^naTvolume: Ten microliters o, GPE 

were then placed a, 60-C to, 30 minutes ^^^^ ^ added. DMA synthesis was 
, 5 ^^-0^^^^ P^rylamide gels essentia,, as 

^szszgz^^^z™ -> — ~ - iud9ed 

from the gel electrophoretograms. 
!0 example 10: Beyers. Transcriptase-M ^d PGR using Recombinant MMLV-RT .rem E. coll 
1200/PUC18N SD9D MMLV Gly Tet(-) 

All PGR reactions were run in a Perkin Elmer-Cetus Model 9600 DNA thermal cycler. The thermal 
cycle! was programmed to incubate the reaction in the fo.lowing manner and sequence. 
25 ir C yc~rkl - C for 30 seconds. 72-C for 2 minutes, and 94-0 for 1 minute; 
72 • C for 5 minutes; 

-preparations.^ 
30 commercial vendor as above. '^^^"^^^^ the experiment were as follows: 5X RT 
optimum for all enzyme preparations usecl The reageri* i used m ^t P Bu{fer E|mer) . 

Buffer (50 mM Tris HCI (pH 8-3). 75 mM ^"^S^ Premix (for each reaction) (4 ul 

Crude preparations of unspliced template RNA «ere made by suspending w«ec l 
HPV nuufeic acid sequences integrated Into their genome) «>™^^Z£^^*^ 

rpe=r,irr,o N s 

SSS S. SSSd by^rbVrer^A tra^pts ™ Prepared as indicated by me 

„ ^"1^ reecaons were conducted as toiiows. £ 

RT premi*. This mixture was ^ « ^XlT^t^ZZ^Z Z c<Sed on ice. Reyers, 
to the target nucleic acids for 10 minutes at 60 C. The -reaction mixture ^ ^ 

= ip er?s T,of ^-•zr^zzz s. — ^ ,„ ,0. a„d 

microliters of the PCR Premix was then added to each sample. The samples were p.a 
cycler at 95 -C, and cycling was performed as described above. 
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, , ™.t « described in Arnold and Nelson, supra . For each 

Hybridization and detects were earned out as des b d DNA wasTdenatured at 

hybridization assay, 30 ul of water was g.ven 10 ul ^ ^re^on ^ ^ 

95 -C lor 5 minutes. Ten microhters of d.luted p robe was added and m, ^ ^ ^ 

at 60-C for 15 minutes. Three hund red m,cro.Je so* eect.n reage ^ ^ remajnjng 

-^rri^L 1 : 5 a SSS —ed, San Diego. OA). 

The results are shown m Table 10. 



Copies of 
Template RNA 



RNA Type 



SiHa cell lysate 



Si Ha cell lysate 



E6 transcript 



E6 transcript 



E6 transcript 



E6 transcript 



E6 transcript 



E6 transcript 



E6 transcript 



E6 transcript 



E6 transcript 



Table 10 



Origin of Reverse Transcriptase 



commercial (who?) 



Average Net RLU 



E. coli 1200/PUC18N SD9D MMLV Gly Tet(-) 



commercial 



E. coli 12 00/pUC18N SD9D MMLV Gly Tet(-l 
commercial 



E. coli 1200/pUCl8N SD9D MMLV Gly Tet(-) 



commercial 



E. coli 1200/pUC18N SD9D MMLV Gly Tet(-) 



2,291 ,786 



1,317,386 



commercial 



E. coli 12 00/pUC18N SD9D MMLV Gly Tet(-) 
commercial 



E6 transcript E. coli 1200/pUC18N SD9D MMLV Gly Tet(-) 



2,283,979 



2,294,856 
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SEQUENCE LISTIMG 



(1) GENERAL INFORMATION: 



(i) APPLICANT: m 

(A) NAME: GEN- PROBE INCORPORATED 

(B) STREET: 9880 CAMPUS POINT DRIVE 

(C) CITY: SAN DIEGO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 92121 

(G) TELEPHONE: (213) 489-1500 

(H) TELEFAX: (213) 955-0440 

(I) TELEX: 67-3510 

(ii) TITLE OF INVENTION: HIGHLY- PURIFIED RECOMBINANT REVERSE 
TRANSCRIPTASE 

(iii) NUMBER OF SEQUENCES: 18 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 
APPLICATION NUMBER: EP 95104959.2 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGCATGCAGC TGGCACGACA GGTTTCCCGA CTGGAAAGCG GGCAGTGAGC GCAACGCAAT 60 
TAATGTGAGT TAGCTCACTC ACTCATTAGG CACCCCAGGC TTTACACTTT ATGCTTCC 118 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCGTGGGTCC GAAATGTGAA ATACGAAGGC CGAGCATACA ACACACCTTA ACACTCGCCT 60 
ATTTTGAAAG TGTGTCCTTT GTCGGTACCG GTACTAATGC TTAAGCTCGA A !«■ 
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(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GATGGGTCTG AACATCGAAG ATGA 24 

{2} INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 

TCATCTTCGA TGTTCAGACC 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGACTCGAAT TCGTAATCAT GGCCATGGTT TAAACCTCCT TAGTGAAATT GTTATCCGCT 60 
CACAATTCCA CACAACATAC GAGCCGGAAG CATAAAGTGT AAGCCTGGGG TGCC 114 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SE(j ID NO : 5: 
AGACTCGAAT TCGTAATCAT GGCCATGGTA TTTAAACCTC CTTAGTGAAA TTGTTATCCG 60 
CTCACAATTC CACACAACAT ACGAGCCGGA AGCATAAAGT GTAAGCCTGG GGTGCC 116 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGACTCGAAT TCGTAATCAT GGCCATGGTA TATTTAAACC TCCTTAGTGA AATTGTTATC 60 
CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAGCCT GGGGTGCC 118 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGGCAGCCAT CACAGAGACT CCAGACACCT CTACCCTCCT CTAATA 46 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGCTTATTAG AGGAGGGTAG AGGTGTCTGG AGTCTCTGTG ATGGCTGCCT TTC 5: 



(2) INFORMATION FOR SEQ ID NO: 



10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATGGGTCTGA ACATC 

(2) INFORMATION FOR SEQ ID NO: 3.1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAAGGAGGT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
RRTTTRR 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTAAAAACC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATGGGT 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATGGGTCTGA ACATC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 
TAAGGAGGTT TAAAAAACC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGGCAGCCAT CACAGAGACT GGAGACACCT CTACCCTCCT C 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



24 



EP 0 688 870 A1 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGCTTATTAG AGGAGGGTAG AGGTGTCTGG AGTCTCTGTG ATGGCTGCCT TTC 

( 2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

/ • • \ epnuENCE DESCRIPTION: SEQ ID NO : 1 "■ 

TTTACACTTT ATGCTTCC 

(3) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

H-n SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

™ c °— =s r= r. 

ACACTCGCCT ATTTTGAAAG TGTGTCC-i ^ ^ 

TTAAGCTCGA A 

(4) INFORMATION FOR SEQ ID NO : 3. 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 3: ^ 
GATGGGTCTG AACAT CGAAG ATGA 

(5) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE : nUCl6iC aCld 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii ) SEQUENCE DESCRIPTION: SEQ ID NO: 4: ^ 
TCATCTTCGA TGTTCAGACC 

(6) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 

(B) TYPE : ™ GleiC aCld 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO : 5 

AG ACT CGAAT TCGTAATCAT GGCCATGGTT TAAACCTCCT TAGTGAAATT 50 
GTTATCCGCT CACAATTCCA CACAACATAC GAGCCGGAAG CAT AAAGTGT1 0 0 

AAGCCTGGGG TGCC 

(7) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: H 6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

AGACTCGAAT TCGTAATCAT GGCCATGGTA TTTAAAC CTC CTTAGTGAAA 50 
TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA AGCAT AAAGT1 1 0 
GTAAGCCTGG GGTGCC 

(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

LaCTCGAAT TCGTAATCAT GGCCATGGTA TATTTAAACC TCCTTAGTGA 50 
AATTGTT AT C CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA100 
GTGTAAGCCT GGGGTGCC 

(9) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

AGGCAGCCAT CACAGAGACT CCAGACACCT CTACCCTCCT CTAATA 46 

(10) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 53 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AGCTTATTAG AGGAGGGTAG AGGTGTCTGG AGT CTCTGTG ATGGCTGCCT SO 
TTC 

(11) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 10: ^ 
ATGGGTCTGA ACATC 

(12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TAAGGAGGT 

(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
RRTTTRR 

(14) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



55 



28 



EP 0 688 870 A1 



TTAAAAACC 

(15) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATGGGT 

(16) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: I 5 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATGGGTCTGA ACATC 

(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: I 9 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TAAGGAGGTT TAAAAAACC 

(18) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: * 6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AGGCAGCGAT CACAGAGACT GGAGACACCT CTACCCTCCT CTAATA 

(19) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AGCTTATTAG AGGAGGGTAG AGGTGTCTGG AGTCTCTGTG ATGGCTGCCT 50 
TTC 



Claims 

1 A method for producing a polypeptide having RNA-directed and DNA-directd DNA polymerase 

promoting cell division and expression of the polypeptide gene, 

d. lysing said host cells to form a cell lysate, and 

e. purifying said polypeptide from the cell lysate. 

2. The method of Claim 1 wherein the polypeptide comprises a single covalently linked amino acid chain. 

3. The method of Claim 2 wherein the gene encoding said polypeptide comprises a nucleic acid 
sequence derived from Moloney Murine Leukemia Virus. 

4. The method of Claim 3 wherein said polypeptide has a methionine residue at an amino terminus. 

5. The method of Claim 4 wherein the polypeptide has a glycine as the penultimate residue at said amino 
terminus. 

B The method of Claim 3 wherein the nucleic acids encoding the five amino acid residues at the amino 
Sminus of saL polypeptide comprise codons to which E^pji tRNA will preferent,a.,y bind. 

T. The method of Claim 3 wherein the nucleic acids ^^^^SS^S 
terminus of said polypeptide consist of the sequence 5 -ATGGGTCTGAACATC 3 (btu 

8. The method of any of claims 1 to 7 wherein said p.asmid further comprises a ribosome binding site 7 
to 9 nucleotides upstream from the initiation codon of sa,d polypeptide gene. 

9 The method of Claim 8 wherein said ribosome binding site comprises a nucleic acid sequence 
complementary to a nucleic acid sequence of E. coli rRNA. 

10. The method of Claim 9 wherein said ribosome binding site comprises the sequence 5.-TAAGGAGGT-3' 
(SEQ ID NO:11). 

, 11. The method of any of Claims 8 to 10 wherein the ribosome binding site is 9 nucleotides upstream from 
the initiation codon of said polypeptide gene. 

12. The method of any of Claims 8 to 11 wherein a spacer ""f^^Z^Z 
binding site and said initiation codon comprises a sequence having an adenine resiaue 
5 three nucleotides upstream from the initiation codon. 

nucleic acid sequence comprising 5 -RRTTTRR-3 (SEU iu inu.i^, 
30 
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overlaps the ribosome binding site. 

T ne method o, Ciaim ,2 o, ,3 wherein ,he space, sequence c„ns,s,s o, S-TTAAAAACC.. ,SEO ,0 
N0:13). 

15 . The method o, an, o, Claims 1 * 14 wherein ,he selectable ma, k e, gene centers resistance to a, leas. 

one antibiotic to said host cell. 
16 The method of Claim 15 wherein said antibiotic comprises tetracycline. 

17 . The method of any of Cairns 1 to 16 wherein the polypeptide further is ab.e to decade an RNA strand 
in an RNA-DNA hybrid. 

,1 The method o, any o, Cairns 1 » .7 wherein said has, ce„ comprises t^enchiaco, strain 1200. 
,9 A composition comprising me polypeptide product b, ,he method ot any ol Cams 1 Id 18. 

' C ° 7S a Plurality of said hos, celis containing * recombinant poiyp.pt.da thereby terming a ceU 
lysate, *• Ko^o m^Hium in the presence of a solution having a 

SCyidO «be recovered polypeptide ,. a ge, tiltrahon column and again recovedng said poiypap- 
tide in at least one fraction, 
g, Tbe method 0. Claim 20 therein tbe RNAse-deticien, hos, ceil consists o, E^coU strain ,200. 
3S 22. The method o, Claim 20 o, 21 wherein the cation-exchange medium comprises a phosphoce.uiose 
derivative. 

«. The method e, any o, Caims 20 to 22 herein ,b. polype baa an apparen, moiecuiar weigh, o, 
about 70,000 daltons. 

25. A recombinant DNA molecule comprising: RNA -directed and DNA-directed DNA polymerase 

45 a. a DNA fragment encoding a polypept.de having RNA d.rectea ana 

bToNA fragment comprising at least one "^J^Si"' said po.ypeptide gene, and 
host cell. 

26. Th6 reoombinan, DNA e, Cm 25 wherein me poiypeptide comprises a single cova,an„y linked amino 
55 acid chain. 

27 The recombinant DNA of Cairn ,6 wherein the gene encoding said po.ypeptide comprises a nucleic 
' acid sequence derived from Moloney Murine Leukemia Virus. 
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28 The recombinant DNA of Claim 27 wherein the gene for said poiypeptide begins with the nuc.eotide 
" sequence 5'-ATGGGT-3' (SEQ ID NO:14). 

u- . hma nf a™ of Claims 25 to 28 wherein the fifteen nucleotides comprising the 5" 
. I! r 2 ^ po^de^ne cTn^risfcodons «. «** E^eK ,- R NA wil, preferential,, b,nd. 

polypeptide gene consist ol me sequence 5,-ATGGGTCTGAACA o ( 

The recombinant DNA of an, ol Cleims 25 to 30 tonne, comprising a ribosome binding site from seven 
" ^^e noTrdes upsuearn'tron, an initiation codon o, said pe„pep,ide gene. 

» The recombinant DNA o« Cm 31 herein said ribosome binding £. ^comprises a nucieic acid 
seqoence sufficiently complementer, to a nucleic acid sequence ot E. col, r RNA. 
' 5 M Toe recombinant DNA of Claim 32 wherein .be ribosome binding site comprises the seqoence 5- 
TAAGGAGGT-3- (SEO ID NO:11). 
34. The recombinant DNA o, an, o, Cairns 25 ^^^S^ZS^TS^ 

33. Tb. recombinen, DNA of Cfaim 34 herein the —^.^Z^T^T^ 
together ha,e a nucleic acid sequence comprising SEQ ID NO.12 where 
25 said nucleic acid sequence overlaps the ribosome binding stte. 

„ The recombinant DNA o. Ctaim 34 wherein the spacer seqoence sonsisu o, the nucleic acid sequence 
5--TTAAAAACC-3' (SEQ ID NO:13). 
„ 37. The recombinant DNA o, Cairn 35 wherein tne space, sequence consis* o, ore nooteic acid stance 
SEQ ID NO:13. 

-^r^rrieJa^ 
35 TAAGGAGGTTTAAAAAACC-3' (SEQ ID NO:16). 

33. The recombinant DNA 0, an, o. Calms 25 to 38 wherein the seiectable marKe, gene confers resistance 
to at least one antibiotic to said host cell. 
40 40 The recombinant DNA of Claim 39 wherein said antibiotic comprises tetraycline. 
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